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Abstract 


Estimation of design flood at a desired location on a river is important for planning, design 
and management of hydraulic structures. This can be carried out through frequency analysis 
by fitting of probability distribution to the series of recorded annual Peak Flood Discharge 
(PFD) data. In this paper, Extreme Value Type-1 (EV1) and Log Pearson Type-3 
distributions are adopted for frequency analysis of annual PFD data of Giri and Jalal rivers. 
Maximum likelihood method is used for determination of parameters of the distributions. 
Goodness-of-Fit tests viz., Anderson-Darling and Chi-square are applied for checking the 
adequacy of fitting of probability distributions to the recorded data. Diagnostic test, viz., 
D-index is used for the selection of most suitable distribution for estimation of PFD. The GoF 
test results and D-index values show the estimated PFD from LP3 distribution could be used 
for design of hydraulic structures in Giri and Jalal rivers and design of flood in river 


protection works of Yamuna river basin. 


Keywords: Anderson-Darling, Chi-square, Design flood, D-index, Frequency, 
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Introduction 


Estimation of design flood at a desired location on a river is required for effective flood-plain 
management, and for efficient design of attenuation storages, bridges, culverts, 
embankments, and flood-protection structures. A design flood is used in comprehensive flood 
management to assess the Peak Flood Discharge (PFD) and to protect human lives and 


properties within a watershed (Stedinger and Griffis, 2012). In hydrological context, 


estimation of PFD can be carried out by two different approaches: (1) statistical approach by 
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applying probability distribution to historical flow data to determine the design flood; and (ii) 
the design storm method uses the rainfall-runoff model and consider the rainfall quantile 
determined by frequency analysis as the input data (Rogger et al., 2012). Statistical approach 
using recorded annual PFD data is considered the standard method for estimation of flood 


quantile for a given return period and hence used in the present study. 


A number of probability distributions such as Extreme Value Type-1 (EV1), Generalized 
Extreme Value (GEV), 2-parameter Log Normal (LN2), Log Pearson Type-3 (LP3) and 
Normal are widely applied for frequency analysis of stream flow data (Saf, 2007; Casas et al., 
2011; Lee et al, 2012; Daneshfaraz et al., 2013). Olumide et al. (2013) applied normal and 
EV1 distributions for prediction of rainfall and runoff at Tagwai dam site in Minna, Nigeria. 
They have also expressed that the normal distribution is better suited for rainfall prediction 
while Log-Gumbel for runoff. Izinyon and Ajumuka (2013) carried out Flood Frequency 
Analysis (FFA) for three tributaries of upper Benue river basin, Nigeria adopting EV1, LN2 
and LP3 distributions. Das and Qureshi (2014) evaluated the probability distributions of 
GEV, LN2 and LP3 adopted in FFA through D-index and found that the LP3 is better suited 
distribution for estimation of PFD for Jiya Dhol river basin. Rasel and Hossain (2015) 
applied EV1 distribution for development of intensity duration frequency curves for seven 
divisions in Bangladesh. In view of the above, EV1 and LP3 distributions are used in the 
present study. Parameters of the distributions are determined by Maximum Likelihood 
Method (MLM) and used to estimate the PFD. For quantitative assessment on PFD data 
within the recorded range, Goodness-of-Fit (GoF) tests such as Anderson-Darling (A’) and 
Chi-square (°) are applied. Diagnostic test, viz., D-index is used for the selection of most 
suitable distribution for estimation of PFD. The methodology adopted in frequency analysis 
of annual PFD data through GoF tests and diagnostic index are briefly described in the 


following sections. 
Methodology 
The study is to estimate design flood of Giri and Jalal rivers of Yamuna River Basin (YRB). 


Thus, it is required to process and validate the data series of annual PFD such as (i) determine 


the parameters of EV1 and LP3 distributions (using MLM); (ii) assess the adequacy of fitting 


of EV1 and LP3 distributions to the series of annual PFD using GoF tests; (iii) selection of 
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most suitable probability distribution for estimation of PFD using D-index and (iv) analyse 


the results obtained thereof. 
PDF and CDF of Probability Distributions 


The Probability Density Function (PDF; f(X)) and Cumulative Distribution Function (CDF; 
F(X)) of EV1 distribution is given as below: 
e (X-a)/B,-e OP 


f (X) =—______ 
(X) B (1) 


FX) =e" p>0 
where, a and ß are the location and scale parameters of the distribution (Gumbel, 1960). The 
parameters are computed by MLM through Equations (2) and (3), and used to estimate the 


PFD (Xv) for different return periods from X, = a+ Y,ß, where Y, =—In(-In(1-(1/T))). 


a= -Bin Sexp(- x,/B)/\N| is (2) 
p=X-[3x,exn(-X,/6)/Zexo(-X,/6)] are) 
SE(X,) =(6/VN \1.15894+0.19187Y, +1.1¥2)" (4) 


where, X; is the recorded PFD of jt sample, X is the average value of PFD and SE(X7) is the 


standard error of the estimated PFD (Xr) for a given return period (T). 


The PDF and CDF of Log Pearson Type-3 (LP3) distribution is given by: 


y-1 _{ InX)-a 
£(X:0,8,7) =—! [m=] (5), B,y>0 and F(R)= ff(X;0.B,y)dX . (5) 
PXTYy B 


where, a, B and y are the location, scale and shape parameters of the LP3 distribution (Rao 
and Hameed 2000). The parameters are computed by MLM through Equation (6) and used to 
estimate the PFD (Xr) for different return periods from X+ = Exp((a+ By) + K,Bva). Here, 
K, is the frequency factor corresponding to the probability of exceedance and coefficient of 


skewness based on the log transformed series of the recorded data for LP3. 
N 
> (In(X;)— a) = NBy 


i=l 


Ny(y) = En [X, )—a) /B] 6 





N= Boy DE [AMX )—a)] 
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where, w(y)=I"'(y)/T(y)is called a digamma function. The SE of estimated PFD adopting 
LP3 distribution (using MLM) is computed from Equation (7) and given by: 
afac aK (C3 
: Kef% Js up SES sc) (7) 


SEX) = 7 J 
g Æ 243C2 +50, +208 
aC, 9 











where, OK .{ Z’-1), a(z? -62) az? -1) 2 „4Z 3 10.4 and Z is the standard normal variate. 
ac, | 6 6 SO ee ae 


The lower and upper confidence limits (LCL and UCL) of the estimated PFD value at 95% 
level are obtained from LCL=X7-1.96(SE(X7)) and UCL= X7+1.96(SE(X7)). 


Goodness-of-Fit Tests 


GoF tests are essential for checking the adequacy of probability distribution to the recorded 
series of annual PFD for estimation of PFD. Out of a number GoF tests available, the widely 
accepted GoF tests are A? and y”, which are used in the study. The theoretical descriptions of 


GoF tests statistic (Charles Annis, 2009) are as follows: 


A’ statistic: 


A? =(-N)-(1/N) $ {2i -1) In(Z,) + (2N+1-2i) nd -Z,)} ... (8) 





Here, Z, =F(X,) for 1=1,2,3,...,N with X1<Xo<....<Xy, F(X;) is the CDF of it sample (X; ) 


and N is the sample size. 


y” statistic: 


, _xc(0,(X)-E,(x))’ 
jel E(X) 





.. (9) 


where, O,(X) is the observed frequency value of j” class, E,(X) is the expected frequency 


value of j” class and NC is the number of frequency class (Zhang, 2002). The rejection 


region of vA statistic at the desired significance level (n) is given by x2 = Le aici Here, m 


denotes the number of parameters of the distribution and y2 is the computed value of x? 


statistic by PDF. 
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Test criteria: If the computed values of GoF tests statistic given by the distribution are less 
than that of the theoretical values at the desired significance level then the distribution is 


found to be acceptable for modelling the series of annual PFD data. 
Diagnostic Test 


Diagnostic test, viz., D-index is used for the selection of most suitable probability distribution 


for estimation of PFD (USWRC, 1981), which is defined by: 


RMSE=— 5 \x, -x| ... (10) 


N i=! 





Here, X is the average value of the recorded PFD data whereas X; and Xf are the six 


highest recorded and corresponding estimated PFD values by PDF. The distribution having 


the least D-index is considered as better suited distribution for estimation of PFD. 
Application 

In this paper, an attempt has been made to estimate the PFD for different return periods for 
Giri and Jalal rivers, which are the tributaries of YRB. The series of annual PFD of Giri river 
for the period 1991 to 2009 and Jalal river for the period 1993 to 2009 is used. Table 1 gives 


the descriptive statistics of annual PFD data of Giri and Jalal rivers. 


Table 1: Descriptive statistics of annual PFD of Giri and Jalal rivers 



































PED series Statistical parameters 

x (m/s) | Sx(m%¥/s) | CV (%) Cs Cx 
Giri river 940.4 918.2 97.6 1.841 2.919 
Jalal river 140.3 86.3 61.5 0.515 -1.010 
Sx: Standard Deviation; Cs: Coefficient of Skewness; Cx: Coefficient of Kurtosis 
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By applying the procedures of EV1 and LP3 distributions, parameters were determined by 


MLM and used for estimation of PFD. Tables 2 and 3 give the PFD estimates with 95% 


lower and upper confidence limits for different return periods adopting EV1 and LP3 


distributions for Giri and Jalal rivers. 


Table 2: PFD estimates with confidence limits using EV1 and LP3 distributions for Giri river 

















































































































Return EVI LP3 
period Xr SE (Xr) | Confidence limits Xr SE (Xr) | Confidence limits 
(year) | (m*/s) | (m?/s) | LCL UCL | (m/s) | (m*/s) | LCL UCL 
2 776.7 145.9 490.7 | 1062.6 661.1 124.2 417.7 904.6 
5 | 1374.3 223.9 935.4 | 1813.1 | 1414.1 230.4 962.5 | 1865.6 
10 | 1769.9 287.2 1207.0 | 2332.9 | 2037.1 330.5 1389.2 | 2684.9 
15 | 1993.2 324.7 1356.9 | 2629.5 | 2418.8 394.0 1646.6 | 3191.0 
20 | 2149.5 351.4 1460.8 | 2838.1 | 2709.5 442.9 1841.5 | 3577.6 
25 | 2269.9 372.1 1540.5 | 2999.2 | 2936.2 | 481.4 1992.7 | 3879.7 
50 | 2640.8 436.9 1784.4 | 3497.1 | 3671.0 607.4 2480.5 | 4861.5 
75 | 2856.3 475.0 1925.3 | 3787.4 | 4150.5 690.3 2797.6 | 5503.4 
100 | 3008.9 502.1 2024.7 | 3993.1 | 4470.7 746.1 3008.4 | 5933.0 
Table 3: PFD estimates with confidence limits using EV1 and LP3 distributions for Jalal river 
Return EVI LP3 
period Xr SE (Xr) | Confidence limits Xt SE (Xr) | Confidence limits 
(year) | (m*/s) | (m/s) | LCL UCL | (m/s) | (m?s) | LCL UCL 
2 | 125.1 19.5 86.9 163.3 123.1 19.2 85.6 160.7 
5| 200.3 29.9 141.7 258.8 206.4 30.8 146.0 266.7 
10 | 250.1 38.3 174.9 325.2 259.4 39.8 181.5 337.3 
15| 278.2 43.3 193.2 363.1 287.5 44.8 199.8 375.3 
20 | 297.8 46.9 205.9 389.7 307.1 48.4 212.4 401.9 
25| 313.0 49.7 215.6 410.3 321.5 51.0 221.5 421.5 
50 | 359.6 58.3 245.3 473.9 363.8 59.0 248.2 479.4 
75 | 386.7 63.4 262.5 511.0 388.3 63.6 263.6 513.1 
100 | 405.9 67.0 274.6 537.3 406.6 67.1 275.1 538.1 





























From Tables 2 and 3, it may be noted that the PFD estimates given by LP3 are consistently 


higher than the corresponding values of EV 1 for return period above 2-year. The plots of 


recorded and estimated PFD using EV1 and LP3 distributions with 95% confidence limits for 


Giri and Jalal rivers are presented in Figures 1 to 4. These figures showed that the recorded 


PFD data are falling between the confidence limits of the estimated PFD obtained from EV 1 


and LP3 distributions for Giri and Jalal rivers. 
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Figure 1: Plots of recorded and estimated PFD using EV1 


distribution with 95% confidence limits for Giri river 
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Figure 2: Plots of recorded and estimated PFD using LP3 


distribution with 95% confidence limits for Giri river 
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Figure 3: Plots of recorded and estimated PFD using EV 1 
distribution with 95% confidence limits for Jalal river 
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Figure 4: Plots of recorded and estimated PFD using LP3 


distribution with 95% confidence limits for Jalal river 
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Analysis Based on GoF Tests 


The adequacy of fitting of EV1 and LP3 distributions for frequency analysis of PFD data was 
performed by adopting GoF tests (A? and 1) and the results are presented in Table 4. 


Table 4: Computed values of GoF tests using EV1 and LP3 distributions 
PED series EVI LP3 
AZ X AZ a 











Giri river 0.673 1.333 0.329 2.333 
Jalal river 0.329 0.500 0.236 0.500 


























From Table 4, it may be noted that the computed values are not greater than the theoretical 
values (0.757 for A’ and 5.99 for 1°) at 5% significance level, and at this level, the GoF tests 


results supported the EV1 and LP3 distributions for modelling the series of annual PFD. 
Analysis Based on Diagnostic Test 
The selection of most suitable probability distribution amongst EV1 and LP3 distributions 


was made through D-index and the results are presented in Table 5. 


Table 5: D-index values of EV1 and LP3 distributions 











PFD series EVI LP3 
Giri river 2.974 2.405 
Jalal river 1.227 0.975 

















From Table 5, it may be noted that the D-index values of LP3 distribution are found to be 
minimum when compared to the corresponding values of EV1 and hence LP3 distribution 
was identified as better suited probability distribution for estimation of PFD. 


Conclusions 


The paper presents the study carried out for frequency analysis of annual PFD for Giri and 


Jalal rivers adopting EV 1 and LP3 distributions (using MLM). GoF tests such as A’ and vA 
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were applied for checking the adequacy of fitting of probability distributions to the recorded 

annual PFD data. The selection of most suitable probability distribution was made through D- 

index. The following conclusions are drawn from the study: 

i) From the frequency curves, it was found that the recorded annual PFD data are falling 
between the confidence limits of the estimated PFD from EV1 and LP3 distributions. 

ii) The GoF test results supported the EV1 and LP3 distributions for modelling the series of 
annual PFD data. 

iii) Based on D-index values, the LP3 distribution was identified as better suited distribution 
for estimation of PFD. 

iv) The study suggested that the upper confidence limit of 100-year return period estimated 
PED of about 5993 m°/s for Giri river and 538 m°/s for Jalal river from LP3 distribution 
(using MLM) could be used for design of hydraulic structures in the rivers. 

v) The study also suggested that the results presented in the paper would be beneficial for 


stake holders for arriving at a design flood in river protection works of YRB. 
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